

The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: patin2Hicln@uspto.gov or phone 703-306-41 19 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3helD@usDto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§ 1 .82 1 - 1.825 effective October I, I990(old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25 . 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

:Pr : $. : : ht|jp://ww^.uspto.gov/w 





t ■ 



R««r Sequence Listing Error Summary 



ERROR DETECTED 



SUGGESTED CORRECTION 



SERIAL NUMBER; 



: ©1 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA*' HEADERS, WHICH WERE INSERTED BY PTO S( 



1 Wrapped Nucleics 
Wrapped Aminos 



• 2 Invalid Line Length 

3 Misaligned Amino 

Numbering 

< Non-ASCII 

5 _ Variable Length . 



6 Patentln 2.0 

**bug" 



7 Skipped Sequences 

(OLD RULES) 



8 Skipped Sequences' 

(NEW RULES) 



Use of n's or Xaa's 

"(NEW RULES) 



10 Invalid <2I3> 

Response 



1 [/(Lot 



<220> 



12 Patentln 2.0 

"bug" 



The nurnberAext at the end of each line "wrapped'* down to the next line. Jhis may occur ifyour file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent •'wrapping." 

The rules require thai a line not exceed 72 characters in length, this includes white spaces. 

The numbering under each 5* amino acid is misaligned Do not use lab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCIl(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission U aavrd In ASCII text. 

■' V ' 

Sequence^) contain n*s or Xaa't representing more thaH one residue.. Per Sequence Rules, 

each n or X*a can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>*<223> section that some may be missing. 

A "bug* in Patentln version 2.0 has cf^ed Aft <220>»<223>*ection to be missing (torn amino acid 

sequencers) . Normally, Patentln woultfautomaUcalfxgcnerate this section from the 

previously coded nucleic acid sequence. Please manually copy tha relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence^) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where M X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ 1D.NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



Sequencers) _ 



missing. If Intentional, please insert Ihc following lines for each skipped sequence 



<2 1 0> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which raiduc n or Xaa represents 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scicnlific'name (Genus/species). <220>-<223> section is required when <2 13> response is Unknown or 
is Artificial Sequence 



Sequencers) _ 



m missing the <220> "Feature" and associated numeric identifiers and responses 



Use of <220> to <223> is MANDATORY if <2 13> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genelic material in <220> to <223> section. 
(Sec "Federal Register," 06/01/1 998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rutes) 

Please do not use "Copy to Disk" function of Patentln version 2,0. This c»u%ts a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 • Misuse of n ' n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to reprecer 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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OIPE 



RAW »£#Ef£tfcE LISTING 

PATENT APPLICATION: US/09/941 , 947 



DATE: 09/18/2001 
TIME: 10:40:16 



Input Set : A:\CL1903 US NA Seq Listing.txt 
Output Set: N:\CRF3\09182001\I941947.raw 



5 <110> APPLICANT: Brzostowicz, Patricia C. 

6 Cheng, Qiong 

7 DiCosimo, Deana J. 

8 Koffas, Mattheos 

9 Miller, Edward S. Jr. 

10 Odom, J. Martin 

11 Picataggio, Steve 

12 Rouviere, Pierre E. 

16 <120> TITLE OF INVENTION: CAROTENOID PRODUCTION FROM A 

20 <130> FILE REFERENCE: CL1903 US NA 
C--> 23 <140> CURRENT APPLICATION NUMBER: US/09/941 f 947 
C--> 23 <141> CURRENT FILING DATE: 2001-08-29 

23 <150> PRIOR APPLICATION NUMBER: 60/229,907 

24 <151> PRIOR FILING DATE: 2000-09-01 

26 <150> PRIOR APPLICATION NUMBER: 60/229,858 

27 <151> PRIOR FILING DATE: 2000-09-01 
29 <160> NUMBER OF SEQ ID NOS : 60 

33 <170> SOFTWARE: Microsoft Office 97 
37 <210> SEQ ID NO: 1 
39 <211> LENGTH: 1311 
41 <212> TYPE: DNA 

43 <213> ORGANISM: Methylomonas 16a 
4 7 <4 00> SEQUENCE: 1 

4 8 gatgtggtca catggcccta tcacttaacg gctgatattc gattttgtca 
50 cttaacttta acttctacac gctcatgaac aaacctaaaa aagttgcaat 
52 ggcggcttgg cgccttgttt gaattccgca atcggtagtt tgatcgaacg 
54 atcgatccta gcatagaaat catttgctat cgcggcggtt ataaaggcct 
56 gattcttatc cagtaacggc cgaagtgcgt aaaaaggcgg gtgttctgca 
58 ggttctgtga tcggcaacag ccgcgtcaaa ttgaccaatg tcaaagactg 
60 ggtttggtca aagagggtga agatccgcaa aaagtcgcgg ctgatcaatt 
62 ggtgtcgata ttctgcacac catcggcggc gatgatacca atacggcagc 
64 gcagcattcc tggccagaaa taattacgga ctgaccgtca ttggtttacc 
66 gataacgacg tatttccgat caagcaatca ctaggtgctt ggactgccgc 
68 gcgcgttatt tcatgaacgt ggtggccgaa aacaacgcca acccacgcat 
70 cacgaagtga tgggccgtaa ctgcggctgg ctgaccgctg caaccgcgca 
72 aaattactgg accgtgccga gtggttgccg gaattgggtt tgactcgtga 
74 gtgcacgcgg tattcgttcc ggaaatggcg atcgacctgg aagccgaagc 
76 cgcgaagtga tggacaaagt cgattgcgtc aacatcttcg tttccgaagg 
78 gaagctatcg tcgcggaaat gcaggccaaa ggccaggaag tgccgcgcga 
80 cacatcaaac tggatgcggt caaccctggt aaatggttcg gcgagcaatt 
82 ataggcgcgg aaaaaaccct ggtacaaaaa tcgggatact tcgcccgtgc 
84 aacgttgacg acatgcgttt gatcaaatcg tgcgccgact tggcggtcga 
86 cgccgcgagt ctggcgtgat cggtcacgac gaagacaacg gcaacgtgtt 
88 gagtttccgc gcatcaaggg cggcaaaccg ttcaatatcg acaccgactg 
"* 90 atgttgagcg aaatcggcca gcctaaaggc ggtaaagtcg aagtcagcca 
93 <210> SEQ ID NO: 2 
95 <211> LENGTH: 437 



Does Not Comply 

^effected Diskette Needed 



6» ** 



SINGLE CARBON SOURCE 



ttggtttttt 
actgacagca 
ttataccgaa 
gttgctgggc 
acgttttggc 
cgtgaaacgc 
ggttaaggat 
agcggatttg 
taaaaccgtc 
cgagcaaggc 
gctgatcgta 
ggaatatcgc 
atcttatgaa 
caagcgcctg 
tgccggcgtc 
tgcgttcggc 
cgcgcagatg 
ttctgcttcc 
gtgcgcgttc 
gcgtgcgatc 
gttcaatagc 
c 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
13 1 ~ 
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RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941,947 TIME: 10:40:16 

Input Set : A:\CL1903 US NA Seq Listing.txt 
Output Set: N:\CRF3\09182001\l941947.raw 

97 <212> TYPE: PRT 

99 <213> ORGANISM: Methylomonas 16a 



103 


<400> SEQUENCE: 


2 
























105 


Asp 


Val 


Val 


Thr 


up 


Pro 


Tvr 
xy i 


His 


Leu 


Thr 


Ala 


Asp 


He 


Arg 


Phe 


Cys 


106 


1 








5 










10 










15 




1 09 

1 VJ _7 


His 


Trp 


Phe 


Phe 


Leu 


Asn 


Phe 


Asn 


Phe 


Tvr 
lyx 


Thr 


Leu 


Met 


Asn 


Lys 


Pro 


110 








20 










25 










30 






1 1 1 


Lys 


Lys 


Val 


Ala 


He 


Leu 


Thr 


Ala 


Gly 


Gly 


Leu 


Ala 


Pro 


Cys 


Leu 


Asn 


114 






35 










40 










4 5 








117 
ii/ 


Ser 


Ala 


He 


Glv 


Ser 


Leu 


He 


Glu 


Arg 


Tvr 
xy x 


Thr 


Glu 


He 


Asp 


Pro 


Ser 


I 1 R 

I I o 




50 










55 










60 










191 

1X1 


lie 


Glu 


He 


He 


Cys 




Arg 


vji y 


viy 


Tvr 
xy x 


uy & 


Gly 


Leu 


Leu 


Leu 


Gly 


1 99 


65 










70 










75 










80 


1 9 R 
i x j 


Asp 


Ser 


x x ± 


Pro 


Val 

val 


Thr 


Ala 




Va 1 
v ct i 


«.x y 


Lys 


Lys 


Ala 


Gly 


Val 


Leu 


12 6 










85 










90 










95 




1 99 
1 x z? 


Gin 


Arcr 


Phe 


Glv 


Gly 


Ser 


Val 


He 


Gly 


Asn 


Ser 


At*ct 

«.iy 


Val 


Lys 


Leu 


Thr 


130 








100 










105 










110 






i j j 


Asn 


Val 


Lys 


Asp 


Cys 


Val 
val 


Ly s 


n.1 y 


i y 




Va 1 
val 


Lys 


Glu 


Gly 


Glu 


Asp 


134 






115 










120 










12 5 








1 ?7 
i j / 


Pro 


Gin 


Lys 


Val 


Ala 


Ala 


A CD 
nop 


oin 




Va 1 

veil 


Lys 


Asp 


Gly 


Val 


Asp 


He 


1 "3 R 

1 JO 




130 










1 IS 

1 J *J 










14 0 










1 A 1 


Leu 


His 


Thr 


He 






Asp 


"Sp 


J. 111 


a.s n 


Th r 
1 ill 


Ala 

M. 1 d 


Ala 


Ala 

n.1 Cl 


A cz n 


lie; n 


1 A 9 
i <r z 


14^ 

it*? 










1 ^0 

1 Jw 










1 SS 
1 — * J 










160 


1 AC 


Ala 


Ala 


Phe 


Leu 




Arg 


Asn 


Asn 


xyr 


n v 
iji y 


Leu 


Thr 
X ilX 


Va 1 

V ct 1 


Tip* 
X i c 


vji y 


Leu 


1 AC 










ID J 










1 70 










175 




1 A Q 


Pro 


Lys 


Thr 


Val 


Asp 


As n 


Asp 


Va 1 

Val 




Prn 

tl (J 


Tip 

lit; 


T.uc 


Vj 1 LI 


OCI 


-UC; kx 


G1 v 
vj i y 


1 R0 
1 J w 








180 










185 










190 






1 


Ala 


Trr* 


Thr 


Ala 


Al a 


cn ii 

V7l U. 






Al a 

rVl d 


A r rr 


Tvr 
xy X 


Ph*=» 

IT 11C 


11 c: u 


fVD 11 


Val 


Val 


1 RA 






195 










200 










205 








JL D / 


Ala 


Glu 


Asn 


Asn 


Ala 


As n 


r 1 U 


Arg 




Leu 


11c 


Va 1 
val 


ni t> 


ul U 


Va 1 

V Cl 1 


1*1 C- l_ 


1 ^R 




210 










91 S 

Zi 1 










990 










161 
1 O 1 


Glv 


Arcr 

n.i y 


Asn 


Cys 


ciy 


XX p 


T.pi i 


1. 1 ii 


rtl d 


Al a 


Thr 
nil 


Ala 
r\l ct 


UlU 


Glu 


Tvr 
xy x 


A rcr 
rtx y 


169 
i u z 


9 9 S 

.Z j£ 










230 










235 










240 


ICC 

lb j 


Lys 


Leu 


Leu 


Asp 


■ 

Arg 


Ala 

Ala 


(jtJ. U 


Trp 


Leu 


Pro 




Leu 


Vj±y 


Leu 


X ill 


Arg 


166 










245 










250 










255 




169 


Glu 


Ser 


Tyr 


Glu 


Val 


His 


Ala 


Val 


Phe 


Val 


Pro 


Glu 


Met 


Ala 


He 


Asp 


170 








260 










265 










270 






173 


Leu 


Glu 


Ala 


Glu 


Ala 


Lys 


Arg 


Leu 


Arg 


Glu 


Val 


Met 


Asp 


Lys 


Val 


Asp 


174 






275 










280 










285 








177 


Cys 


Val 


Asn 


He 


Phe 


Val 


Ser 


Glu 


Gly 


Ala 


Gly 


Val 


Glu 


Ala 


He 


Val 


178 




290 










295 










300 










181 


Ala 


Glu 


Met 


Gin 


Ala 


Lys 


Gly 


Gin 


Glu 


Val 


Pro 


Arg 


Asp 


Ala 


Phe 


Gly 


182 


305 










310 










315 










320 


185 


His 


He 


Lys 


Leu 


Asp 


Ala 


Val 


Asn 


Pro 


Gly 


Lys 


Trp 


Phe 


Gly 


Glu 


Gin 


186 










325 










330 










335 




189 


Phe 


Ala 


Gin 


Met 


He 


Gly 


Ala 


Glu 


Lys 


Thr 


Leu 


Val 


Gin 


Lys 


Ser 


Gly 


190 








340 










345 










350 






193 


Tyr 


Phe 


Ala 


Arg 


Ala 


Ser 


Ala 


Ser 


Asn 


Val 


Asp 


Asp 


Met 


Arg 


Leu 


He 


194 






355 










360 










365 









r\ it o ir\-% 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/941,947 



DATE; 09/18/2001 
TIME: 10:40:16 



Input Set : A:\CL1903 US NA Seq Listing -tit 
Output Set: N:\CRF3\09182001\I941947.raw 



197 Lys Ser Cys Ala Asp Leu Ala Val Glu Cys Ala 



198 



370 



375 



201 Gly Val lie Gly His Asp Glu Asp Asn Gly Asn 



202 385 



390 



395 



205 Glu Phe Pro Arg lie Lys Gly Gly Lys Pro Phe 

206 405 410 

209 Trp Phe Asn Ser Met Leu Ser Glu lie Gly Gin 

210 420 425 

213 Val Glu Val Ser His 

214 435 

217 <210> SEQ ID NO: 3 

219 <211> LENGTH: 636 

221 <212> TYPE: DNA 

223 <213> ORGANISM: Methylomonas 16a 

227 <4 00> SEQUENCE: 3 

228 gaaaatacta tgtccgtcac catcaaagaa gtcatgacca 
230 atggtcatca atcatctgga acatgccgtc cctctggctc 
232 ttgaaagttt tggagatcac attgcgcacg ccggtggcac 
234 aaagccgaag taccggacgc catcgtcggc gcgggcacca 
236 tatcaagcga ttgacgccgg tgcggaattc atcgtcagcc 
238 ctcaacgaag cgctagcatc cggcgtgcct atcctgcccg 
240 gtcatgcgtt tattggaaaa aggcatcaat gcgatgaaat 
242 ggcggcatac cgatgctgaa atcccttggc ggccccttgc 
244 accggcggcg tcaatcccaa aaacgcgccc gaatatctgg 
24 6 gtcggcggct cctggatggc gccggccgat ctggtagatg 
248 acgcggcggg cgagcgaggc cgcggcattg aaaaaa 
251 <210> SEQ ID NO: 4 

253 <211> LENGTH: 212 

2 55 <212> TYPE: PRT 

257 <213> ORGANISM: Methylomonas 16a 

261 <400> SEQUENCE: 4 



Phe Arg Arg Glu Ser 
380 

Val Leu Arg Ala lie 
400 

Asn lie Asp Thr Asp 
415 

Pro Lys Gly Gly Lys 
430 



cctcgcccgt 
gcgcgctagt 
tggaatgtat 
tcatcaaccc 
ccggcatcac 
gcgtcatcac 
tctttccggc 
cgcaagtcac 
cattgaaaaa 
ccgaagactg 



tatgccggtc 
cgacggtggc 
ccgacgtatc 
tcataccttg 
cgaaaatcta 
acccagcgag 
tgaagccgcc 
cttctgtccg 
tgtcgcctgc 
ggcggaaatc 



263 


Glu 


Asn 


Thr 


Met 


Ser 


Val 


Thr 


He 


Lys 


Glu 


Val 


Met 


Thr 


Thr 


Ser 


Pro 


264 


1 








5 










10 










15 




267 


Val 


Met 


Pro 


Val 


Met 


Val 


He 


Asn 


His 


Leu 


Glu 


His 


Ala 


Val 


Pro 


Leu 


268 








20 










25 










30 






271 


Ala 


Arg 


Ala 


Leu 


Val 


Asp 


Gly 


Gly 


Leu 


Lys 


Val 


Leu 


Glu 


He 


Thr 


Leu 


272 






35 










40 










45 








275 


Arg 


Thr 


Pro 


Val 


Ala 


Leu 


Glu 


Cys 


He 


Arg 


Arg 


He 


Lys 


Ala 


Glu 


Val 


276 




50 










55 










60 










279 


Pro 


Asp 


Ala 


He 


Val 


Gly 


Ala 


Gly 


Thr 


He 


He 


Asn 


Pro 


His 


Thr 


Leu 


280 


65 










70 










75 










80 


283 


Tyr 


Gin 


Ala 


He 


Asp 


Ala 


Gly 


Ala 


Glu 


Phe 


He 


Val 


Ser 


Pro 


Gly 


He 


284 










85 










90 










95 




287 


Thr 


Glu 


Asn 


Leu 


Leu 


Asn 


Glu 


Ala 


Leu 


Ala 


Ser 


Gly 


Val 


Pro 


He 


Leu 


288 








100 










105 










110 






291 


Pro 


Gly 


Val 


He 


Thr 


Pro 


Ser 


Glu 


Val 


Met 


Arg 


Leu 


Leu 


Glu 


Lys 


Gly 


292 






115 










120 










125 








295 


He 


Asn 


Ala 


Met 


Lys 


Phe 


Phe 


Pro 


Ala 


Glu 


Ala 


Ala 


Gly 


Gly 


He 


Pro 


296 




130 










135 










140 











60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
636 
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RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941, 947 TIME: 10:40:16 

Input Set : A:\CL1903 US NA Seq Listing.txt * 
Output Set: N:\CRF3\09182001\I941947.raw 

299 Met Leu Lys Ser Leu Gly Gly Pro Leu Pro Gin Val Thr Phe Cys Pro 

300 145 150 155 160 

303 Thr Gly Gly Val Asn Pro Lys Asn Ala Pro Glu Tyr Leu Ala Leu Lys 

304 165 170 175 

307 Asn Val Ala Cys Val Gly Gly Ser Trp Met Ala Pro Ala Asp Leu Val 

308 180 185 190 

311 Asp Ala Glu Asp Trp Ala Glu lie Thr Arg Arg Ala Ser Glu Ala Ala 

312 195 200 205 

315 Ala Leu Lys Lys 

316 210 

319 <210> SEQ ID NO: 5 

321 <211> LENGTH: 1860 

323 <212> TYPE: DNA 

325 <213> ORGANISM: Methylomonas 16a 

329 <4 00> SEQUENCE: 5 

330 atgaaactga ccaccgacta tcccttgctt aaaaacatcc acacgccggc ggacatacgc 60 
332 gcgctgtcca aggaccagct ccagcaactg gctgacgagg tgcgcggcta tctgacccac 120 
334 acggtcagca tttccggcgg ccattttgcg gccggcctcg gcaccgtgga actgaccgtg 180 
336 gccttgcatt atgtgttcaa tacccccgtc gatcagttgg tctgggacgt gggccatcag 240 
338 gcctatccgc acaagattct gaccggtcgc aaggagcgca tgccgaccat tcgcaccctg 300 
340 ggcggggtgt cagcctttcc ggcgcgggac gagagcgaat acgatgcctt cggcgtcggc 360 
342 cattccagca cctcgatcag cgcggcactg ggcatggcca ttgcgtcgca gctgcgcggc 420 
344 gaagacaaga agatggtagc catcatcggc gacggttcca tcaccggcgg catggcctat 4 80 
346 gaggcgatga atcatgccgg cgatgtgaat gccaacctgc tggtgatctt gaacgacaac 540 
348 gatatgtcga tctcgccgcc ggtcggggcg atgaacaatt atctgaccaa ggtgttgtcg 600 
350 agcaagtttt attcgtcggt gcgggaagag agcaagaaag ctctggccaa gatgccgtcg 660 
352 gtgtgggaac tggcgcgcaa gaccgaggaa cacgtgaagg gcatgatcgt gcccggtacc 720 
354 ttgttcgagg aattgggctt caattatttc ggcccgatcg acggccatga tgtcgagatg 780 
356 ctggtgtcga ccctggaaaa tctgaaggat ttgaccgggc cggtattcct gcatgtggtg 84 0 
358 accaagaagg gcaaaggcta tgcgccagcc gagaaagacc cgttggccta ccatggcgtg 900 
360 ccggctttcg atccgaccaa ggatttcctg cccaaggcgg cgccgtcgcc gcatccgacc 960 
362 tataccgagg tgttcggccg ctggctgtgc gacatggcgg ctcaagacga gcgcttgctg 1020 
364 ggcatcacgc cggcgatgcg cgaaggctct ggtttggtgg aattctcaca gaaatttccg 1080 
366 aatcgctatt tcgatgtcgc catcgccgag cagcatgcgg tgaccttggc cgccggccag 1140 
368 gcctgccagg gcgccaagcc ggtggtggcg atttattcca ccttcctgca acgcggttac 1200 
370 gatcagttga tccacgacgt ggccttgcag aacttagata tgctctttgc actggatcgt 1260 
372 gccggcttgg tcggcccgga tggaccgacc catgctggcg cctttgatta cagctacatg 1320 
374 cgctgtattc cgaacatgct gatcatggct ccagccgacg agaacgagtg caggcagatg 1380 
376 ctgaccaccg gcttccaaca ccatggcccg gcttcggtgc gctatccgcg cggcaaaggg 1440 
378 cccggggcgg caatcgatcc gaccctgacc gcgctggaga tcggcaaggc cgaagtcaga 1500 
380 caccacggca gccgcatcgc cattctggcc tggggcagca tggtcacgcc tgccgtcgaa 1560 
382 gccggcaagc agctgggcgc gacggtggtg aacatgcgtt tcgtcaagcc gttcgatcaa 1620 
384 gccttggtgc tggaattggc caggacgcac gatgtgttcg tcaccgtcga ggaaaacgtc 1680 
386 atcgccggcg gcgctggcag tgcgatcaac accttcctgc aggcgcagaa ggtgctgatg 1740 
388 ccggtctgca acatcggcct gcccgaccgc ttcgtcgagc aaggtagtcg cgaggaattg 1800 
390 ctcagcctgg tcggcctcga cagcaagggc atcctcgcca ccatcgaaca gttttgcgct 1860 
393 <210> SEQ ID NO : 6 

395 <211> LENGTH: 620 

397 <212> TYPE: PRT 



ri/1 o/A1 



<210> SEQ ID NO 41 
<211> LENGTH: 38 
<212> TYPE: DNA 

<213> ORGANISM ^Artificial Sequence 
<220> FEATURE : 



<223> OTHER INFORMATION": 
<400> SEQUENCE : 41 

agcagctagc ggaggaataa accatgagcg catttctc 




38 



<210> SEQ ID NO 42 
<211> LENGTH: 26 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequ£ 
<220> FEATURE: 
<223> OTHER INFORMATI 
<400> SEQUENCE: 42 




26 



gactagtcac gacctgctcg aacgac 



feu ^ 3 



The type of errors shown exist throughout 
the Sequence Listing. Please check subsequent 
sequences for similar errors. 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/941 , 947 



DATE: 09/18/2001 
TIME: 10:40:17 



Input Set : A:\CL1903 US NA Seq Listing*txt 
Output Set: N:\CRF3\09182001\I941947.raw 



L:23 M:270 C: Current Application Number differs, Replaced Current Application No 

L:23 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

0:3057 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:41 

L:3061 M:258 W: Mandatory Feature missing, <220> FEATURE: 

0:3061 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 

L:3071 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:42 

0:3075 M:258 W: Mandatory Feature missing, <220> FEATURE: 

L;3075 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 
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